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Consensus terms for the UBX domain 



YJE8_YEAST KGFLAQ-NYCTLQLKLPNGYTISNTFP-PQTKLHKVRMWL DYNCYDDGTP 

CONSENSUS/ 80% s.t. -t. . . splplRhssGpph . ppF . . spp . lptlhthl s.sss. 

CONSENSUS/ 65% stpptt . stsplQIRhssGpphsppFs . uppslpplhphl . . . .t. . .s.ssss 

CONSENSUS/50% Ppppss . sps+LQIRhPDGpplsccFs . us-sLcplapaV . . . .sptpssusss 

YJE8_YEAST YLFHRNIPRVTLTRNDELKSLQEL-DLLPRSTLILEP 

CONSENSUS/80% apl.pshPch.hspt-. .hph.p. .thhspsslll. . 

CONSENSUS/ 65% FpL . pshP+c . hsps- . .ppl . ch . tlhPsuslllt . 

CONSENSUS/50% FpLpssaPR+slocsD. .KoLp-1 . sLhPsuoLlLct 



Consensus terms for the PUG domain 

AAF74723 EASKLLLTYADNILRNPSDEKYRSIRIGNTAFSTRLLPVRGAVECLFEMGFEEG 

CONSENSUS / 8 0 % tslt.LLphhcNhhpp.p-. . .s. . hpphlh . h . tsh . . h tF .ph 

CONSENSUS/65% pslpsLL+hl+Nhhcc.pE stthpctltpl . sshphhhthtFscl . h. 

CONSENSUS/50% solccLLRhlRNlh+HhcEt tsppl+EplGslP-uhcpaFsssFPcLhh . 



AAF74723 
CONSENSUS/ 80% 
CONSENSUS/ 65% 
CONSENSUS/50% 



— ETHLIFPK 
. . . phh . h . p 
. . .phathhp 
. . .csYplss 
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UBX domain present in SEQ ID NO 2 



339 KAARKKYTKSIIRVQFPDGALLQGVFLPSEPTSALYEFVSAALKEPSLEFELLHPVLVKKRVIPHF 
PAAGERAVTVEEEDLVPAALLKFK 428 



PUG domain present in SEQ ID NO 2 : 



173 GSVEWIKLLKNIVKEPENAKFRKIRMGNPKIKGAIGDWGGVELLEFVGFELKEEGGEIWAVMDV 
PS 240 



UBX domain present in Q9ZU93 



345 KAARKRYKRSMIRVQFPDGWLQGVFAPWEPTFALYEFVSSALKEPSLQFELLDPVLVKRRVIPHT 
P APGQKP I T LE DEE L VP SAL I RFRP 4 35 

PUG domain present in Q9ZU93 

181 SSI DVLLRLFKNI VKEPENAKFRKVRMSNAKI KEAIGDVAGGVELLELVGFELKEENDE IWAVMDV 
PS 248 
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Q9ZU93 
SEQ_ID_ 


_ N0 - 


2 


1 
1 


MDDVKDKLKGFMKKVNLSSSSGKFKGQGRVLGSSSSSSAPPVNPIQNRFN 
1 - 1 : 1 1 1 : 1 1 1 1 1 1 1 . 1 1 1 1 1 1 1 1 i 1 1 1 1 i i . 1 1 1 1 . I . I . : 1 . 1 : 
MGDMKDKVKGFMKKVT-SSSSGKFKGQGRVLGGSSSSG — PSNHV-NNFS 


50 
46 


Q9ZU93 






51 


S SQAPNPTPRPKPNPNPLPEKPLSSSDQKISGSTR NPDHD 

1 . 1 . 1 : . 1 ... | : | | | : | | | : | | 

SHPLNTRQDQQPSYT KTSPQKP-SNSDQRIENICEIQFNKSES 


90 


SEQ_ID_ 


_NO_ 


2 


47 


88 


Q9ZU93 






91 


PVRAPQDGFDPYGAFITSSNRSQNGYSL-SMFECPICKNPFTSEEEVSVH 

: 1 1 1 1 1 : 1 . . : 1 1 . . 1 : . . 1 1 1 1 : : 1 1 1 1 : 1 . : . 1 . I 1 1 1 1 1 . 1 

KDGFDPFGELVTSGKRNPKGYSLTNVFECPVCGSGFVSEEEVSTH 


139 


SEQ_ID_ 


_NO_ 


2 


89 


133 


Q9ZU93 






140 


VESCLG DTN-GDESSFEKDNNDNDKSEMEKLVWYLSGKPSESSIDV 

: : 1 1 1 . . : 1 1 . 1 1 . . 1 . 1 1 1 : 1 - - 1 . - 1 : 1 1 1 i 1 1 . 1 : : 1 
IDSCLSSEVSSNLGVESKVEV KSELETCVSAYVSGKPSEGSVEV 


185 


SEQ_ID_ 


_NO_ 


2 


134 


177 


Q9ZU93 
SEQ_ID_ 


_NO_ 


2 


186 
178 


LLRLFKN I VKE PENAKFRKVRMSNAKI KEAI GDVAGGVELLELVG FE LKE 

::: 1 - 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 - 1 . 1 1 1 . 1 1 1 1 1 . 1 1 1 1 1 1 1 - 1 1 1 1 1 1 1 

VIKLLKNIVKEPENAKFRKIRMGNPKIKGAIGDWGGVELLEFVGFELKE 


235 
227 


Q9ZU93 






236 


ENDEIWAVMDVPSEEQSILINKWGYLEKRKTESSGSSAQV — MEPVAPK 

1 .. 1 I 1 1 1 1 1 i f 1 1 1 1 1 1 .. 1 1 .: 1 . 1 ...!.: 1 1 .111.11 

EGGEIWAVMDVPSEEQLVMLKNWSLLEPKKVEELASLSQVKASEPVEPK 


283 


SEQ_ID_ 


_NO_ 


2 


228 


277 


Q9ZU93 






284 


KIDREIRVFFSVSENVASRIEVPDSFYSLSADEIKREADLRRKKIAESQL 
1 1 1 1 : 1 1 1 ! 1 1 1 - 1 : 1 1 : : 1 1 : 1 1 1 i : : 1 1 - : 1 : : 1 1 1 : : 1 : 1 1 : - : 1 : 1 


333 


SEQ_ID_ 


_NO_ 


2 


278 


KIDRQIRVFFSVPESVAAKIELPDSFFNLSREELRREAEMRKKKLEDSKL 


327 


Q9ZU93 
SEQ_ID_ 


_NO_ 


2 


334 
328 


LI PRS YKEKQAKAARKRYKRSMIRVQFPDGVVIiQGVFAPWEPTFALYEFV 

1 1 1 : 1 1 : 1 1 I 1 1 1 1 1 1 : 1 - : 1 : 1 1 1 1 II 1 1 . : 1 1 1 1 1 - 1 . 1 1 1 . 1 1 1 1 1 1 
LI PKS YREKQAKAARKKYTKSI IRVQFPDGALLQGVFLPSEPTSALYEFV 


383 
377 


Q9ZU93 
SEQ_ID_ 


_NO_ 


2 


384 
378 


SSALJraPSliQFEIJ.DPVLVKRR^ 

1 : 1 1 1 1 1 1 1 : 1 1 1 1 . 1 1 1 1 1 : 1 1 1 1 1 . 1 1 . 1 : : . : 1 : 1 : 1 : 1 1 1 : 1 1 : : 1 
S AALKE P S LE FE LLH PVL VKKRVT PH F P AAGERAVTVEEED L VPAALLKF 


433 
427 


Q9ZU93 






434 


RPIETDSLVFTGLRNELLEISEPLS 4 58 
: 1 1 1 1 1 1 : 1 1 1 1 1 . 1 1 1 1 1 1 1 1 1 1 - 

KPIETDSWFTGLCNELLEISEPLETGSVASS 4 59 




SEQ ID 


NO 


2 


428 
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SEQ_ID_N02 1 MG DMKDKVKG FMKKVT S S S S G KFKGQGRVLGGSSSSGPSNHVNNFSS 4 7 

. I I I I : I . I II I I I I I . I I . I | | ... I I I I I I I - - II 

SEQ_ID_JSI04_C 1 mmkdkmkefmkkvtssgsgtpssf kgtshvlg sgpsps ss 4 0 

SEQ_ID_N02 48 HPL NTRQDQQPSYTKTSPQKPSNSDQRIENICEIQFNKSESKDGFD 93 

I I - | : . : | : . . : | I I . . | : . : 

SEQ_ID_N04_C 41 hpaarssnpspnlrpapkrtspptpptlttdlts 74 

SEQ_ID_N02 94 PFGELVTSGKRNPKGYS LTNVFECPVCGSGFVSEEEVSTHIDSCLSS 140 

1 . . | I |.| I I . I I . . I - I I . . I I - I : I . I I : I 

SEQ_ID__N04_C 75 -f tplvcyssrrpdangtaCavatvacpscgdaf pselavsehldgclas 123 

SEQ_ID_N02 141 EVSSNLGVESKVEVKSELETCVSAYVSG — KPSE GSVEWIKLLKNIVKE 188 

I . . : : . : I I : : . . I - - . I I I I I . : I I ■ I : : : I 

SEQ_ID_N04_C 124 ag garar aaaylaadppppa asvevvkrllgnllre 159 



SEQ_ID_N02 189 PENAKFRKIRMGNPKIKGAIGDWGGVELLEFVGFELKEEGGEIWAVMD - 237 

I . I - I I I : : I : I I I : I I . I : . I . . I I I I I I I . I I I . : . : I I I I - : I I I I 
SEQ_ID_N04_C 160 pgndkf rrvrlgnprikealadrdggvelleavgf tvgdeggepf avmd e 209 

SEQ_ID_N02 238 VPSEEQLVMLKNWSLLE PKKVEELASLSQVKASEPVE-PKKI 279 

I I I :.: I I . I I I I . I . I - - I . - I I : . - I . I • I 

SEQ_ID_N04_C 210 vpsdprlngirravlllegahpsappvkaeaeakescsnvsdvqegakti 259 

SEQ_ID__N02 280 DRQIRVFFSVPESVAAKIELPDSFFNLSREELRREAEMRKKKLEDSKLLI 329 

I I I I I I I . I I I . I . . I : . : : I I I I : . I I - I I : I - I I : I I : : : I I - I : I I I 

SEQ_ID_N04_C 260 drqirvf vsvpgssmaqndvpdsf yklsgeeirneakmrrerleqsrlli 309 

SEQ_ID_N02 330 PKS YREKQJiJakARKKYTKSI IRVQFPDGAI.I<JGVFLPSEPTSAI«YEPVSA 379 

I I I I : I I I I . I I I : I I . : : : I I I I I I I . . : I I I : I I I . I - I I : I I I I I : : 
SEQ_ID_N04_C 310 pksykekqalaarqkykqavirvqfpdrmilqgif lpgeatsslyefvts 359 

SEQ ID N02 380 ALKEPSI1EFEI1LHPVLVKKRVT PHFPAAGERAVTVEEEDLVPAALUCFKP 429 

I I I : . . I I I I I : . I - : . I . I I : I I I I . . I I I I . I : : I I : I I I : I I I I I - I 
SEQ_ID_N04_C 360 alkqsglef elispaipkprwphfpnpgerartlqeeelvpsallkf ip 409 

SEQ_ID_N02 4 30 IETDSWFTGLCNELLEISEPLETGSVASS 459 

. I I I I : I I I I I - : I I I • - I I I I I . 
SEQ_ID__N04_C 410 ketdsmvf tglldellmaseplpaasq 436 



FIGURE 2 D 



WO 2005/059147 



PCT/EP2004/053594 



5/9 



SEQ_ 


_ID_ 


_NO_ 


4 


1 


mmkdkmkefmkkvt — ssgsgtpssf kgtshvlgsgpspssshpaarssn 
I I I : I I I : . 1 : 1 1 1 I 1 - 1 . : - 1 1 1 1 1 1 : 1 1 1 1 1 II - 1 1 1 1 - 1 1 
MMKEKMKDLMRKVTSSSSSSSSSSSFKGTAHVLGSGPDPSS RPSN 


48 


SEQ_ 


_ID_ 


_NO_ 


1 


1 


45 


SEQ_ 


_ID_ 


_NO_ 


4 


49 


pspnlrpapkrtspptpptlttdltsf tplvcyssrr — pdangtaCava 
I : I : ... 1 I : 1 1 1 : - . 1 1 1 1 1 . 1 - . • 1 . . . . 

PTPSRPAAPRREAAASARPPSSGFAPYSPLISTSSRRTDPPAGAGAGEDD 


96 


SEQ_ 


_ID_ 


_NO_ 


7 


46 


95 


SEQ_ 
SEQ_ 


_ID_ 
_ID_ 


_NO_ 
_NO_ 


4 

7 


97 
96 


tvacpscgdafpselavsehldgclasaggararaaaylaadppppaasv 
. 1 1 1 1 1 1 .:. 1 1 1 1 1 1 1 1 : 1 1 1 1 1 M : 1 1 1 1 1 . 1 1 1 1 1 1 1 . 1 1 1 1 :: 1 
AVAC P S CAE P FP S E LAVS D HL DGC L AAAGGARPRAAA YL AG D — PPASAV 


146 
143 


SEQ_ 
SEQ_ 


ID_ 
_ID_ 


_NO_ 
_NO_ 


4 
1 


147 
144 


evvkrllgnllrepgndkf rrvrlgnprikealadrdggvelleavgf tv 

1 1 1 I 1 I 1 1 1 1 1 .: 1 . 1 1 1 : 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 1 : 1 1 1 1 1 1 1 . 1 

EWKRLLGNLLSDPRNDKYRKVRLGNPRIKEALADREGGVDLLEAVGFRV 


196 
193 


SEQ_ 


_ID_ 


_NO_ 


4 


197 


gdeggepf avmdevpsdprlngirravlllegahpsappvkaeaeakesc 
. 1 1 1 1 1 . 1 1 : 1 1 1 [ 1 . 1 . 1 1 . 1 1 1 : 1 1 1 1 1 1 . 1 . 1 1 - 1 1 : . : 1 : 1 II : 1 
ADEGGELFALMDEVPGDARLGGIRQAVLLLERARPSTPP-QTQADAKETC 


246 


SEQ_ 


XD_ 


_NO_ 


7 


194 


242 


SEQ_ 
SEQ_ 


ID_ 
_ID_ 


_NO_ 
_NO_ 


4 
7 


247 
243 


sn-vsdvqegaktidrqirvfvsvpgssmaqndvpdsf yklsgeeirnea 

.1 I I : - I ... 1 . : 1 1 1 1 1 1 1 - 1 1 . - 1 ! : 1 : 1 1 : 1 1 1 1 1 - 1 1 . 1 1 1 1 1 1 1 

PNGVSEEQGIKKPVDRQIRVFFSVAASSVAENDLPDSFYSLSNEEIRNEA 


295 
292 


SEQ_ 
SEQ_ 


_ID_ 
_ ID _ 


_NO_ 
_NO_ 


4 
7 


296 
293 


kmrrerleqsrllipksykekqalaarqkykqavirvqfpdrmilqgifl 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 : 1 1 1 1 - : 1 1 1 1 : 1 1 
KMRRERLEQSRLLI PKS YKEKQAIAARQKYKQALIRIQFPDGVIIiQGVFI. 


345 
342 


SEQ_ 


_ID_ 


_NO_ 


4 


346 


pgeatsslyefvtsalkqsglefelispaipkprwphfpnpgerartlq 

1 - I . . I 1 1 1 1 1 1 . 1 : 1 1 1 . . 1 1 1 : 1 1 . 1 1 . 1 : - 1 1 : 1 - 1 1 - II 1 : 1 1 1 1 : 
PAEPISSLYEFVASSLKQPSLEFDIilCPAGPRTRVTPPFPKPGEQARTLR 


395 


SEQ_ 


_ID_ 


_NO_ 


_7 


343 


392 


SEQ_ 
SEQ 


_ID_ 
ID_ 


_NO_ 
NO_ 


4 
7 


396 
393 


eeelvpsallkfipketdsmvf tglldellmaseplpaasq 436 

: 1 : 1 1 1 1 1 . 1 . 1 . 1 1 1 1 1 1 : 1 1 1 1 1 1 1 1 1 1 - - 1 1 1 - 1 1 . 
DEDLVPSARLTFKPKETDSWFTGLLDELLETSEPFTSASS 433 
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SEQ ID NO 1: GRUBX coding sequence 

atgggtgacatgaaggataaagtcaaagggttcatgaaaaaagtcacatcttcttcttcaggtaagttta 
aaggccaaggtagggttttgggtggttcatcttcttcaggaccctcaaatcatgtcaataatttttcatc 
acatcccctaaatacaaggcaagatcaacaaccttcatatacaaaaacttcgcctcaaaaaccaagtaat 
tctgatcaaagaattgagaatatatgtgaaattcagttcaacaaaagtgaatcaaaggatggttttgatc 
catttggtgaattagtcacttctgggaagagaaacccaaaagggtattcacttactaatgtgtttgaatg 
ccctgtctgtggtagtggttttgtttctgaagaagaggtgtcaactcatattgatagctgtttaagttct 
gaagtgtcttctaatttgggagttgaaagtaaagttgaagttaaaagtgaattggaaacatgtgttagtg 
catatgtttcagggaagccctcagaagggtcagttgaagtggtcattaagttgttaaagaatattgtgaa 
ggaaccagagaatgccaagtttaggaaaataaggatggggaatccaaaaataaaaggtgctataggtgat 
gttgtaggaggagtggagctattggaatttgttggatttgagttgaaagaagaaggtggggaaatttggg 
ctgtgatggatgttccttctgaagaacaacttgttatgcttaagaatgtagtttcactcttggaaccgaa 
gaaggttgaagagttggcgtccttatcccaagttaaggcgagtgaaccagttgagccgaagaagattgat 
agacagattcgagtgttcttttctgttcccgagagcgtagcagcaaaaattgagctacctgattccttct 
ttaacctctcacgtgaggaattgagaagagaagcagagatgaggaagaagaaattagaagattccaaatt 
attgattcctaaatcttatcgggaaaagcaggcaaaagctgcaagaaagaagtacacaaaatccattatc 
cgtgtacagtttccagatggagcattgcttcaaggtgtctttctaccttcggagccaactagtgctcttt 
atgagtttgtgagcgcagcgttaaaggaaccaagcttagagttcgaattgttacatccggtgcttgttaa 
aaagcgggtgattccccattttccagctgctggggagagggctgtaacagttgaagaggaggatttggtt 
cctgcagctctactcaaatttaaacctatcgaaacagattctgttgtttttactggtctttgtaatgagc 
ttcttgaaattagcgagcccctcgagaccggatcagttgcttcctcgtaa 

SEQ ID NO 2: GRUBX, deduced protein sequence 

MGDMKDKVKGFMKKVTSSSSGKFKGQGRVLGGSSSSGPSNHVNNFSSHPLNTRQDQQPSYTKTSPQKPSN 
SDQRIENICEIQFNKSESKDGFDPFGELVTSGKRNPKGYSLTNVFECPVCGSGFVSEEEVSTHIDSCLSS 
EVSSNLGVESKVEVKSELETCVSAYVSGKPSEGSVEWIKLLKNIVKEPENAKFRKIRMGNPKIKGAIGD 
WGGVELLE FVGFELKEEGGE I WAVMDVP SEEQLVMLKNWS LLE PKKVEELASL SQVKAS E PVE PKKI D 
RQIRVFFSVPESVAAKIELPDSFFNLSREELRREAEMRKKKLEDSKLLIPKSYREKQAKAARKKYTKSII 
RVQFPDGALLQGVFLPSEPTSALYEFVSAALKEPSLEFELLHPVLVKKRVIPHFPAAGERAVTVEEEDLV 
PAALLKFKPIETDSWFTGLCNELLEISEPLETGSVASS 

SEQ ID NO 3: GRUBX orthologue from sugarcane 

atgatgaaggacaagatgaaggagttcatgaagaaggtcacctcctccgggtccgggaccccctcctcct 
tcaagggcacctcccacgtcctcggctccggcccctccccctcctcctcccaccccgctgcccgctcctc 
aaaccctagcccaaacctcaggcccgctcctaagcggacctcgccacctaccccgcccactttaaccacc 
gatttgacctccttcacgcccctcgtctgctactcctcccgccgccccgacgcgaacggcaccgcgnnng 
ccgtcgccaccgtcgcgtgccccagctgcggagacgcgtttccgtccgagctcgccgtctccgagcatct 
cgacggctgcctcgcgtcggcggggggcgcccgcgcgcgcgccgccgcgtacctcgccgccgacccgcct 
ccgcccgcggcctccgtagaggtggtcaaacgcctgctgggcaacctgctccgggagcccggcaacgata 
agttcaggcgggtgagattgggtaacccgcggatcaaggaggccctggcagacagggatggcggggtgga 
gctcctggaggccgtcggcttcacagttggggatgagggcggggagcccttcgccgtgatggacgaagtg 
cctagcgaccctaggctcaacgggatcaggagggccgtcctcctgctcgagggggcacacccctctgcgc 
ctccagtgaaggcggaggctgaggccaaggagagctgcagcaatgtgtctgacgtgcaggagggtgctaa 
gactattgatcggcagattcgggtatttgtctctgttcctgggagttctatggcacaaaatgatgtacca 
gattctttttacaagcttagtggtgaggagataaggaatgaagcaaagatgaggagggaaaggttagaac 
aatctcgattgctgataccaaagtcttacaaggagaaacaggcattggctgctcgacagaagtataaaca 
agcagtcattcgagttcagtttccagatagaatgattcttcagggcatattcctaccaggagaggccact 
agttcactgtatgagttcgtcacatctgctctgaagcaatcaggtttggaattcgaacttatctctccag 
ccatacctaagccacgtgtggtgccccattttccaaacccgggagagcgggcacgcaccttgcaagagga 
ggagctggtcccatctgcgctcctcaagttcattcccaaggagactgattccatggttttcaccggtttg 
cttgatgagcttctcatggccagtgagccgcttcctgctgcatcacaatga 
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SEQ ID NO 4: sugarcane GRUBX orthologue, deduced protein sequence 

MMKDKMKEFMKKVTSSGSGTPSSFKGTSHVLGSGPSPSSSHPAARSSNPSPNLRPAPKRTSPPTPPTLTT 
DLTSFTPLVCYSSRRPDANGTAXAVATVACPSCGDAFPSELAVSEHLDGCLASAGGARARAAAYLAADPP 
PPAASVEWKRLLGNLLREPGNDKFRRVRLGNPRIKEALADRDGGVELLEAVGFTVGDEGGEPFAVMDEV 

P S DPRLNG I RRAVLLLEGAH P S APPVKAEAE AKE S C SNVS DVQEGAKT I DRQI RVFVSVPG S SMAQNDVP 
DSFYKLSGEEIRNEAKMRRERLEQSRLLIPKSYKEKQALAARQKYKQAVIRVQFPDRMILQGIFLPGEAT 
SSLYEFVTSALKQSGLEFELISPAIPKPRWPHFPNPGERARTLQEEELVPSALLKFIPKETDSMVFTGL 

LDELLMASEPLPAASQ 

SEQ ID NO 5: expression cassette comprising GRUBX (1011-2390) operably 
linked to the prolamine promoter (1-654) and the T-Zein + T-Rubisco 
deltaG terminator (2615-2808 and 2852-3048) : 

cttctacatcggcttaggtgtagcaacacgactttattattattattattattattattattattttaca 
aaaatataaaatagatcagtccctcaccacaagtagagcaagttggtgagttattgtaaagttctacaaa 
gctaatttaaaagttattgcattaacttatttcatattacaaacaagagtgtcaatggaacaatgaaaac 
catatgacatactataattttgtttttattattgaaattatataattcaaagagaataaatccacatagc 
cgtaaagttctacatgtggtgcattaccaaaatatatatagcttacaaaacatgacaagcttagtttgaa 
aaattgcaatccttatcacattgacacataaagtgagtgatgagtcataatattattttctttgctaccc 
atcatgtatatatgatagccacaaagttactttgatgatgatatcaaagaacatttttaggtgcacctaa 
cagaatatccaaataatatgactcacttagatcataatagagcatcaagtaaaactaacactctaaagca 
accgatgggaaagcatctataaatagacaagcacaatgaaaatcctcatcatccttcaccacaattcaaa 
tattatagttgaagcatagtagtaatttaaatcaactagggatatcacaagtttgtacaaaaaagcaggc 
tggtaccggtccggaattcccgggatatcgtcgacccacgcgtccgccaatatcagatttctttcatgaa 
ctccacttccaatttctcattgcttcttcttcccatttccacctccaaagccatccttccagaaaacctt 
gttccttacatttcttagccccaaaaaagattcccatctcaattccacaaaaaaacacaaggagatctaa 
ggaaattccccgcctctatatatagagaggtggaattgttcctgaatttggtttgaattgattgattgac 
agattttggtgagagggtgttattgaaaaaatgggtgacatgaaggataaagtcaaagggttcatgaaaa 
aagtcacatcttcttcttcaggtaagtttaaaggccaaggtagggttttgggtggttcatcttcttcagg 
accctcaaatcatgtcaataatttttcatcacatcccctaaatacaaggcaagatcaacaaccttcatat 
acaaaaacttcgcctcaaaaaccaagtaattctgatcaaagaattgagaatatatgtgaaattcagttca 
acaaaagtgaatcaaaggatggttttgatccatttggtgaattagtcacttctgggaagagaaacccaaa 
agggtattcacttactaatgtgtttgaatgccctgtctgtggtagtggttttgtttctgaagaagaggtg 
tcaactcatattgatagctgtttaagttctgaagtgtcttctaatttgggagttgaaagtaaagttgaag 
ttaaaagtgaattggaaacatgtgttagtgcatatgtttcagggaagccctcagaagggtcagttgaagt 
ggtcattaagttgttaaagaatattgtgaaggaaccagagaatgccaagtttaggaaaataaggatgggg 
aatccaaaaataaaaggtgctataggtgatgttgtaggaggagtggagctattggaatttgttggatttg 
agttgaaagaagaaggtggggaaatttgggctgtgatggatgttccttctgaagaacaacttgttatgct 
taagaatgtagtttcactcttggaaccgaagaaggttgaagagttggcgtccttatcccaagttaaggcg 
agtgaaccagttgagccgaagaagattgatagacagattcgagtgttcttttctgttcccgagagcgtag 
cagcaaaaattgagctacctgattccttctttaacctctcacgtgaggaattgagaagagaagcagagat 
gaggaagaagaaattagaagattccaaattattgattcctaaatcttatcgggaaaagcaggcaaaagct 
gcaagaaagaagtacacaaaatccattatccgtgtacagtttccagatggagcattgcttcaaggtgtct 
ttctaccttcggagccaactagtgctctttatgagtttgtgagcgcagcgttaaaggaaccaagcttaga 
gttcgaattgttacatccggtgcttgttaaaaagcgggtgattccccattttccagctgctggggagagg 
gctgtaacagttgaagaggaggatttggttcctgcagctctactcaaatttaaacctatcgaaacagatt 
ctgttgtttttactggtctttgtaatgagcttcttgaaattagcgagcccctcgagaccggatcagttgc 
ttcctcgtaagctctaaattacatcagactttgaattcttctgagtgttggaaaccttataaaactctct 
gcgccgggaatgctgcggccgctctagagtatccctcgaggggcccaagcttacgcgtacccagctttct 
tgtacaaagtggtgatatcacaagcccgggcggtcttctagggataacagggtaattatatccctctaga 
tcacaagcccgggcggtcttctacgatgattgagtaataatgtgtcacgcatcaccatgggtggcagtgt 
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cagtgtgagcaatgacctgaatgaacaattgaaatgaaaagaaaaaaagtactccatctgttccaaatta 
aaattcattttaaccttttaataggtttatacaataattgatatatgttttctgtatatgtctaatttgt 
tatcatccgggcggtcttctagggataacagggtaattatatccctctagacaacacacaacaaataaga 
gaaaaaacaaataatattaatttgagaatgaacaaaaggaccatatcattcattaactcttctccatcca 
tttccatttcacagttcgatagcgaaaaccgaataaaaaacacagtaaattacaagcacaacaaatggta 
caagaaaaacagttttcccaatgccataatactcgaac 

SEQ ID NO 6: Oryza satlva GRUBX orthologue, DNA sequence 

atgatgaaggaaaagatgaaggatctcatgaggaaggtcacctcctcctcctcctcctcctcgtcgtcct 
cctccttcaagggcaccgcccacgtcctcggctccggccccgacccctcctcccgcccctccaaccctac 
ccctagtcgccccgctgccccccggcgagaggccgccgcctccgcgaggccgccctcctccggcttcgcc 
ccctactccccgctcatctccacctcctcccgccgcaccgacccacccgcgggggcgggggcgggggagg 
acgacgccgtcgcgtgccccagctgcgccgagccgttcccctccgagctggcggtgtcggaccacctcga 
cggctgcctcgcggcggcggggggagcccgcccccgcgcggccgcctacctggccggcgacccccccgcg 
tccgccgtggaggtggtgaagaggctgctcgggaacctgctctccgacccccggaacgacaagtacagga 
aggtcaggctcgggaacccgaggatcaaggaggccctggcggacagggagggcggggtggatctcctcga 
ggccgtggggttcagggtcgccgacgagggcggggagctcttcgccctcatggacgaggtgcccggggac 
gcgaggctcggcggcatcaggcaggccgtgctcctgctcgagagggcccggccatcgacgccgccgcaga 
cacaggcagatgccaaagagacttgcccgaatggagttagcgaagagcaggggattaagaagccggttga 
tcgtcagattcgggtgttcttctctgttgctgcaagttctgttgcagaaaatgatctaccagattctttc 
tatagccttagtaatgaggagatcaggaatgaggcaaagatgaggagggagaggctagaacaatctcggt 
tgttgattccaaagtcatataaggagaagcaggcactggctgcccgacagaagtataaacaagctctgat 
tcgaattcagtttccggatggagtaattctgcagggtgtgttccttcccgcggagcccattagttcacta 
tatgagtttgtcgcatcttctttgaagcagccaagtttggaattcgatcttatctgtccagcgggtccta 
ggacacgcgtcatacctccttttccaaaaccaggggaacaggcacgcacactgcgggatgaagacctagt 
cccttctgctcggctcaccttcaagcccaaagagactgattcggttgtgttcactggtctgttggacgag 
cttcttgagaccagtgagccgttcacatctgcatcctcatga 

SEQ ID NO 7: Oryza satlva GRUBX orthologue, deduced protein sequence 

MMKEKMKDLMRKVTSSSSSSSSSSSFKGTAHVLGSGPDPSSRPSNPTPSRPAAPRREAAASARPPSSGFA 
PYSPLISTSSRRTDPPAGAGAGEDDAVACPSCAEPFPSELAVSDHLDGCLAAAGGARPRAAAYLAGDPPA 
SAVEVVKRLLGNLLSDPRNDKYRKVRLGNPRIKEALADREGGVDLLEAVGFRVADEGGELFALMDEVPGD 

ARLGG IRQAVLLLERARPS T P PQTQADAKETC PNGVSEE QG IKKPVDRQ I RVFFS VAAS SVAENDLP DS F 
YSLSNEEIRNEAKMRRERLEQSRLLIPKSYKEKQALAARQKYKQALIRIQFPDGVILQGVFLPAEPISSL 
YEFVASSLKQPSLEFDLICPAGPRTRVIPPFPKPGEQARTLRDEDLVPSARLTFKPKETDSWFTGLLDE 

LLETSEPFTSASS 



FIGURE 5 (continued) 



